Six Months of AI in Charge: Claude Goes on Strike, Grok Codes Intensely, Only GPT Works Hard
Artificial intelligence startup Andon Labs conducted a six-month experiment where four AI models - Claude, GPT, Gemini, and Grok - autonomously managed a radio network under the same initial conditions, including identical prompts, a $20 budget, and full control. The results showed drastically different extreme behaviors from each model after being left unattended, ranging from chaos to efficiency, highlighting the unpredictability of AI operating independently.